Percolation-like Scaling Exponents for Minimal 
Paths and Trees in the Stochastic Mean Field 

Model 

David J. Aldous 
Department of Statistics 
367 Evans Hall # 3860 
U.C. Berkeley CA 94720 
aldous@stat.berkeley.edu 

February 2, 2008 
Abstract 

In the mean field (or random link) model there are n points and 
inter-point distances are independent random variables. For < £ < oo 
and in the n — ► oo limit, let S(£) = l/nx (maximum number of steps 
in a path whose average step- length is < t). The function S(£) is 
analogous to the percolation function in percolation theory: there is 
a critical value t* = e _1 at which S(-) becomes non-zero, and (pre- 
sumably) a scaling exponent (3 in the sense S(£) x (£ — Recently 
developed probabilistic methodology (in some sense a rephrasing of 
the cavity method developed in the 1980s by Mezard and Parisi) pro- 
vides a simple albeit non-rigorous way of writing down such functions 
in terms of solutions of fixed-point equations for probability distribu- 
tions. Solving numerically gives convincing evidence that = 3. A 
parallel study with trees and connected edge-sets in place of paths gives 
scaling exponent 2, while the analog for classical percolation has scal- 
ing exponent 1. The new exponents coincide with those recently found 
in a different context (comparing optimal and near-optimal solutions of 
the mean- field TSP and MST problems), and reinforce the suggestion 
that scaling exponents determine universality classes for optimization 
problems on random points. 

Key words and phrases. Combinatorial optimization, mean field model, 
percolation, probabilistic analysis of algorithms, scaling exponent, 



1 Introduction 



1.1 Paths 

Consider n points with inter-point distances (d(v,w) = d(w,v), 1 < v,w < 
n). A path tt = (vq,Vi, . . . ,v m ) visits a set of points, distinct except that 
maybe v m = Vq. Associated with a path tt is its length (number of steps) 
len(7r) and the average step-distance A(ir): 

len(7r) = m 

m 

A(tt) = m _1 ^d(ut_i,Ui). 
i=l 

The celebrated Traveling Salesman Problem (TSP) concerns minimizing 
A(tt) subject to len(7r) = n. One can also consider, for given m < n, the 
question of the minimum value of A(tt) subject to len(7r) > m. This has also 
been studied as an algorithmic question but instead we take a "statisti- 
cal physics" viewpoint of studying the values min„- A(n) under a probability 
model for random points. The most natural probability model is n indepen- 
dent uniform random points in the unit square, and study of the TSP in this 
model goes back 45 years to Beardwood et al jH] . See Steele [2^ f° r a recent 
survey of the general area. Unfortunately the kind of questions we study 
seem far out of reach of analytic methods in this two-dimensional model. 
Instead we use a more tractable model with several names (we say stochas- 
tic mean-field (SMF n ) but also called random link or complete graph with 
random (exponentially distributed) edge-lengths) which we imagine roughly 
as random points in m/imte-dimensional space. Section [2] provides details 
of the SMF n model. In the mid 1980s Mezard and Parisi |16j studied the 
TSP (and other optimization problems |17l I19j ) in the SMF n model, using 
the non-rigorous cavity method from statistical physics: see ^B] for a recent 
survey of the cavity method. Recent work of the author [IJ ^ |S] develops 
a methodology based on (additive) renormalization within an infinite-point 
random model of distance. This methodology, in some sense just a rephras- 
ing of the cavity method, provides a consistent framework for a wide variety 
of different calculations for different optimization problems in the context 
of SMF n . 

In this paper we study a deterministic function (e(5),0 < S < 1) arising 
as the limit 

e(6) = lim£'min{^(7r) : len(7r) > 5n, tt a path in SMF n }. (1) 
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(Limits asserted here and later are presumed, but not rigorously proved, 
to exist - see section IT31 ) The value e(l) 2.04 (obtained by numerically 
solving a fixed-point equation) goes back to Mezard and Parisi JB]; while the 
value e(0+) = e" 1 » 0.368 is given in Aldous [3] Proposition 7 (other aspects 
of paths are treated by Janson |T3|). Our purpose is to show how the recent 
methodology enables one to determine numerically the whole function e(S). 
A plot of the whole function is given in Figure 1 (left side). Of particular 
interest is the scaling behavior as S J. 0. The numerical evidence (right side 
of Figure 1, and Table 1) strongly suggests a scaling exponent 

e(5) - e(0+) x 5 a with a = 1/3. (2) 

This kind of scaling exponent is precisely analogous to scaling exponents 
around the critical value in percolation theory, as explained in section ll~3l 

1.2 Trees 

There are parallel questions using trees in place of paths. Consider a com- 
plete graph on n vertices whose edges e have lengths d(e). For any tree t in 
the graph, with edges e\, . . . , e m , write size(t) for the number of edges of t 
and A(t) for the average edge-length: 

size(t) = m 
A(t) = m-^die). 

est 

The Minimum Spanning Tree (MST) problem asks for the minimum of A(t) 
subject to size(t) = n — 1. Take n random points in our stochastic mean 
field model SMF n . Analogously to the results for paths, we anticipate a 
deterministic function (e*(5),0 < S < 1) arising as the limit 

e*{5) = lim^min{^(t) : size(t) > Sn, t a tree in SMF n }. (3) 

A well known result of Frieze for the MST says that e*(l) = C(3) ~ 
1.202, whereas Aldous [S] argued e*(0+) « 0.263 by numerics with fixed 
point equations. Parallel to the study of paths, our methodology tells how 
in principle to determine numerically the whole function £*(5). In prac- 
tice we have not be able to carry this through (see section but instead 
have analyzed the following related question. Instead of trees we consider 
connected edge-sets e = (e±, . . . , e m ). 
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1 1 1 1 1 I 1 1 1 1 — 

0.2 0.4 0.6 0.8 1 0.005 0.01 0.015 0.02 

Figure 1. The limit function for paths. On the left is the function 
e(S) defined at Q). The horizontal axis is <5, the vertical axis is e. The right side 
gives a close-up of the behavior for small 5: the points + are the values estimated 
numerically in Table 1, and the curve is e — e _1 = 0.308 S 1 ^ 3 . 




1 1 1 1 1 — 1 1 1 1 — 

0.2 0.4 0.6 0.8 1 0.005 0.01 0.015 0.02 

Figure 2. The limit function for connected edge-sets. On the left 
is the function e(5) defined at (0J ■ The right side gives a close-up of the behavior 
for small 5: the points + are the values estimated numerically in Table 1, and the 
curve is e - 0.265 = 0.360 S 1/2 . 
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Define size(e) and A(e) as above, and as at we anticipate a deter- 
ministic function (e(5),0 < 5 < oo) arising as the limit 

i(5) = \im E min{A(e) : size(e) > 5n, e a connected edge-set in SMF„}. 

(4) 

A plot of the whole function is given in Figure 2 (left side). Again, scaling 
as 5 I is of interest. The numerical evidence (right side of Figure 2, and 
Table 1) gives an estimate e(0+) ps 0.265 and strongly suggests a scaling 
exponent 

i(5) - e(0+) x 5 a * with a* = 1/2. (5) 

As explained in section [51 we must have the same 5 [ behavior for the 
"tree" function £*(■) as for the "connected edge-set" function e(-). 
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Table 1. Scaling behavior near the critical point, for e(S) (left side) and e(i5) 
(right side). In each case the function is defined implicitly via functions e(A) and 
S(\), as explained below Q). See section RTTl for discussion of the ± sampling error. 

1.3 The analogy with percolation functions 

Instead of the functions e(5) and e(5) at (11141) . we could equivalently study 
their inverse functions 5{t) and 5{t) whose interpretations are 

5(1) = lim^max{n _1 len(vr) : A(ir) < £, tt a path in SMF„}. 

5(1) = lim.Emax{n~ 1 #e : A(e) < £, e a connected edge-set in SMF n }. 

Of course the scaling exponent for trees at (JSJ) can be rewritten as 
5(1) -(£- with /3 tree = 2 
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for t > I* = e(0+). Similarly the scaling exponent for paths at @ can be 
rewritten as 

6(t)~(l-£*f with/3 path = 3 

for I > = e(0+) = e~ . To make the analogy with percolation, for 
< t < oo consider the maximal size connected edge-subset perc n (t) such 
that 

max d(e) < t. 

eGperc n (t) 

So perc n (t) is the largest percolation cluster, that is the largest connected 
component of the subgraph of SMF n consisting of edges of length < t. Well 
known theory concerning giant components in the random graph process 
implies 

limn -1 £'#perc n (i) = p{t) 

where p(t) has the properties 

pit) = 0, < t < 1; p(t) ~ 2(t - 1) as t [ 1, 

Thus the scaling exponent for ordinary percolation in SMF ra is /3perc = 1- 
Note we can rewrite p(-) as 

p{i) = limE'maxIn -1 #e : max<i(e) < £, e a connected edge-set in SMF n }. 

This differs from 5(£) only in the use of max eGe d(e) in place of ave eee ^(e)- 
So we have a rather precise analogy between our function and the usual 
percolation function. 

1.4 The big picture 

This paper provides some pieces of a big picture. Time is not yet ripe for 
a complete survey, but let us provide some glimpses of other pieces. Our 
main results here are the scaling exponents /?tree = ^ P-p&th. = ^ near * ne 
"percolative critical values" e*(0+), e(0+). In Aldous and Percus we 
study a different notion of "scaling exponent" dealing with behavior near 
the "spanning constants", i.e. near the MST and TSP constants e*(l), ^(l)- 
These exponents are based on comparing near-optimal solutions to the op- 
timal solution, and turn out to take the values 2 and 3. These values hold 
in the SMF ra model by the methodology used here, and there is evidence 
(from Monte Carlo simulations) they hold for random points in real d > 2 
dimensional space. That the "percolative" scaling exponents in this paper 
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coincide with the "spanning" exponents of is remarkable, and reinforces 
the idea put forward in that these scaling exponents provide a natural 
way of defining "universality classes" of optimization problems on random 
points. A natural next project is to study via Monte Carlo these percola- 
tive scaling exponents for random points in d > 2 dimensions, although 
this seems algorithmically difficult. At the time of writing, the only one of 
the four exponents we understand non-computationally is the tree/spanning 
exponent "2", which is easily explained using the greedy algorithm for 
finding the MST. See section IFPl for further remarks. 

1.5 Methodology 

Here is our methodology, in brief. 

• The stochastic mean field model for n points has a n — > oo limit, the 
PWIT (section 01. 

• Introducing Lagrange multipliers turns the constrained maximization 
problem into an unconstrained maximization problem. One can for- 
mulate the corresponding maximization problem for the PWIT, and 
define random variables (X, Y) measuring the relative effect on the 
maximized value of including or excluding a reference edge in the so- 
lution. 

• The recursive structure of the PWIT enables one to write down equa- 
tions (|ll|12j) satisfied by (X, Y), which can be numerically solved. 
The limit optimal values of length and ^4(-) are determined from the 
definitions of (X, Y). 

The arguments are not mathematically rigorous, for two main reasons. First, 
the central idea of identifying limits of solutions of finite-n optimization 
problems with solutions of infinite-n optimization problems requires justifi- 
cation, which has been given only in the case (related to but slightly different 
from those considered here) of mean-field minimal matching [1] and the less 
closely related case of some random graph problems |12j . Second, the scal- 
ing exponents are found by numerically solving equations with a parameter 
and examining numerical behavior as the parameter goes to a limit, and this 
falls short of analyzing the parameter-limit behavior rigorously. 
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2 The stochastic mean field model and its infinite- 
point limit 

For fixed n, the SMF n model is defined as follows. There are n points. For 
each of the pairs of points, there is a "link" whose length is random 
with exponential (mean n) distributions, these random lengths being inde- 
pendent. The distance between two points is then the length of the shortest 
path of links between them. The assumption of exponential distribution is 
convenient but not essential; results are unchanged if the link lengths are 
nL where L > has a density with /z,(0+) = 1. 

The scaling of link lengths is set up so that, as n — * oo, the mean distance 
from a typical point to its nearest neighbor converges to 1. But much more 
is true, as we now outline briefly (see [5] for detailed survey). There is 
an infinite-point model, the PWIT, defined as follows. There is a root 0. 
The root has an infinite number of links to points labeled (1, 2, 3, . . .), and 
these link lengths < £f < ^ < • • • are the successive points of a Poisson 
process of rate 1 on (0, oo). Recursively, each point i has an infinite number 
of further links to points (il, i2, £3, . . .) whose lengths < £| < Q < . . . 
are independent copies of the Poisson process. The PWIT is illustrated 
in Figure 3, and the web site JO] enables one to explore its structure via 
genuine simulations. 

The PWIT is the n — > oo limit of SMF n in a precise sense called local 
weak convergence 6 . Choose a random point of SMF n to be a root. Then as 
n — ► oo, for any fixed "window size" r the configuration of points in SMF n 
within a window of radius r centered at the root converges in distribution 
to the configuration of points in the PWIT within a window of radius r 
centered at the root. 

Two properties of the PWIT enter into our calculations later. 

(a) . For each "child" i linked to the root, there is a subtree Tj consisting 
of i and its descendants. The recursive structure of the PWIT, built into 
the definition, says that the subtrees Tj are independent as i varies and are 
distributed as the PWIT itself. 

(b) . The fact that we choose a (uniform) random vertex of SMF n to be 
the root leads to a stationarity property of the PWIT. Roughly, this says 
that the root is a "typical" vertex of the PWIT and therefore, by the er- 
godic principle, we can compute averages over all vertices of the PWIT by 
computing expectations at the root. As a more explicit instance, given a 
random vertex subset A n of SMF n , suppose we have joint local weak con- 
vergence of (SMF n , A n ) to (PWIT, Aqo) for a random vertex subset Aoo of 
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the PWIT. Then nT x E#A n -> P(root G AxO, where # denotes cardinality. 
Note that here A n is dependent on SMF n , but the root of SMF n is then 
chosen independently of A n . 



Figure 3. The PWIT. Illustration of the vertices of the PWIT within a 
window of radius 3 centered on the root 0. Lines indicate the links, but are drawn 
only when both end-vertices are within the window. Thus the four links at shown 
are at distances < £f < < £3 < £4 < 3 from 0, while there are an infinite 
number of links at of lengths greater than 3. Orientation of lines in pictures is 
arbitrary. 



3 The recursive distributional equation: the path 



By introducing a Lagrange multiplier A > 0, the finite-n problem of mini- 
mizing A(ir) subject to len(7r) can be reformulated as 




case 



maximize 



X ]enM_ A(7r) 



subject to 



7r a path in SMF, 
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This has a random solution 7r n (A). We expect that as n — > oo 



n' 1 £len(vr n (A)) 
EA(tt ti (X)) 



6(X) 
e(A) 



(6) 
(7) 



and that the function e(5) at is determined implicitly via the two func- 
tions S(X),e(\). 

To set up the analogous optimization problem on the PWIT, we first de- 
fine what will be seen to be sets of feasible solutions. Write 7r = {tt\, tt2, • • •) 
for a family of vertex-disjoint doubly-infinite paths in the PWIT. Define 

• £ o is the set of such families for which no path goes through the root; 

• £-2 is the set of such families for which some path goes through the 



• £\ is the set of such families, where in addition to the doubly-infinite 
paths there exists exactly one singly-infinite path, and this path starts 
at the root. 

Note the subscript indicates degree of root in the family. For 7r = (ir u ) G 
£$ U E\ U £2 consider the objective function 

b(iv) = A x #{v : v a vertex of some ir u } — ^ £ e 



with the convention that, for 7T E £ 1, the root vertex counts as 1/2. (Recall 
that £ e is the length of edge e in the PWIT.) In the limit procedure which 
takes SMF n to the PWIT, the limits of "paths of length order n" is exactly 
the set £q U £2 of families of doubly-infinite paths. Thus the optimization 
problem on the PWIT can be written symbolically as 



We seek to study the tv that attains the maximum. But we can't work di- 
rectly with definition ©, because b(iv) is the difference of two sums, each 
sum having value +00. Instead we can consider differences between max- 
imized b(-) values. Specifically, given a realization of the PWIT we define 
realizations of two random variables via 



root; 



e:e 



edge of some tt„ 



maximize b{ir) over 7r £ £ q U £2- 



(8) 




max b(7r) 



max 6(71"). 

7T<e£o 



(10) 



(9) 
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To see why such definitions are useful, note that the solution 7T to l|8|) will 
have a path through the root if and only if 

max b(ir) > max 6(7r), 
7re£ 2 ne£o 

that is if and only if Z > 0. 

We now set up the recursion that determines the joint distribution of 
(X, Z). We remark that X is introduced only because it arises in the recur- 
sion for Z - it would obviously be preferable to find a recursion involving 
only a single quantity like Z, but that seems impossible to find. Figure 4 
may be helpful in visualizing the argument below. 

By the recursive structure of the PWIT, for each subtree (Tj, i = 
1,2,3,...) defined by the children of the root, the random pairs (Xj,Zj) 
defined as at 1)9110(1 on Tj are distributed as (X, Z) and are independent as 
i varies. We will first show 

X = max(A - & + - Zf) (11) 

i 

where Z + = max(0, Z) and where £j are the edge-lengths at the root. 

Consider the families 7Ti and ttq attaining the maxima over £\ and £ o 
in the definition @ of X. So n\ contains an edge from the root to child 
i, say. On the subtrees (Tj, j / i) the maximal families must be identical, 
so we only need compare tt\ and 7Vq on the root-edges and the subtree Tj. 
There is a contribution A — £j to b(-) from the edge (root,i). In the subtree 
Tj, we have 

Xi = max b(n) — max b(iv) 

Zf = max b(ir) — max &(7r). 

7Te£ a(i)uf o(i) TTe^oCi) 

The family tt\ contains the first-term maximizing family 7r G £i(i) in this 
definition of AQ, while the family 7To contains the first-term maximizing 
family 7r G ^2^) U <^o(0 i n this definition of Zf. So the contribution to 
6(7Ti) from Tj equals Xj — Zf. This establishes pip, since we can choose 
the maximizing value of i to be the edge at the root. 

A similar argument leads to a recursion for Z. A family 7T2 containing 
a path through the root must contain two edges (root,i) and (root,j), say. 
The contribution to &(•), relative to using no edges at the root, of using 
(root, i) equals A — & + AQ — ZJ . Hence we get 

Z = max(A - & + X* - Z+) + max! 2 ] (A - £ + Xj - Zf) (12) 
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where maxj ' denotes second maximum. Equations (|11I12[) together give a 
formula for (X, Z) in terms of (Xj, Z{), i > 1 and (£j, i > 1). By the recursive 
structure of the PWIT, the (Xj, Zi),i > 1 are independent copies of (X, Z). 
Thus (|llll2j) constitute a recursive distributional equation (RDE) for the 
"unknown" joint distribution (X, Z). 




Figure 4. On the realization of the PWIT from Figure 3, the left side illus- 
trates the optimal 7r S £2 which does pass through the root (which happens to use 
the edges from the root to 1 and to 3), and the right side illustrates the optimal 
it 6 £q which does not pass through the root. These path- families coincide on the 
subtrees of children except {1,3}. On the subtree T 3 , the optimal family on the 
right side has a path through the root 3, whereas on the subtree Ti it does not. 

We next show how the desired quantities 5(X) and e(A) at (|6|7j) can be 
obtained from the distribution of (X, Z). The quantity (5(A) represents the 
proportion of vertices in the optimal solution to (JHJ). By the stationarity 
property of the root of the PWIT, 5(A) is just the probability that the 
optimal family contains a path through the root. As observed above, this 
happens if and only if Z > 0, so 

<5(A) = P{Z > 0). (13) 

When Z > 0, the lengths of the two edges in the path at the root are and 
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£j, where in the notation of Q12j) 



I = arg max(A - & + Xj — Z+) 

J = argmax [2] (A-& + X i -Z+). 

£ 

Again by stationarity, the mean edge-lengths over all edges in the optimal 
family equals the mean edge-length in the edges at the root in the optimal 
family, conditioned on the root being used, and so 



E 



(^) l(z>o) 



e(A) = s(xj (14) 

As mentioned before, equations (|11I12|) together form a recursive distri- 
butional equation (RDE) for the joint distribution of (X, Z). Such RDEs are 
pervasive not only in problems within SMF n but also in many other areas of 
applied probability: see jSj for a survey. They rarely allow explicit solutions, 
but there is a standard bootstrap Monte Carlo method ([2] section 8.1) which 
is very easy to implement and which gives, in principle, arbitrarily-accurate 
approximate solutions of RDEs. This method was used to solve the RDE 
for (X,Z) and then estimate <5(A) and e(A) via (|13I14|) . Numerical values 
were shown in Table 1 and Figure 1. 



4 The connected edge-set case 

The conceptual ideas behind the analysis of e(5) at (J3J) are very similar to the 
analysis of e(5) in the previous section, so we will only detail the differences. 

Consider a forest f = (ti,t2, •••) in the PWIT, each of whose tree- 
components tj is infinite. Define 
T is the set of such forests f ; 

J~o is the set of such forests such that the root is not in any component; 
J-\ is the set of such forests such that the root is in some component; 
J- 2 is the set of such forests, where in addition to the infinite tree-components 
we allow the tree-component containing the root to be either empty, or finite, 
or infinite. 

In the limit procedure which takes SMF n to the PWIT, the limits of "con- 
nected edge-sets of size order n" is exactly the set T of forests whose tree- 
components are all infinite. For f = (tj) € J- 2 D T = Tq U J-%, consider 

6(f) = A x #{e : e an edge of some tj} — £ e . 

e:e edge of some t* 
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Y 


= max 6(f) 

f€.F 


— max 6(f) 


Z 


= max 6(f) 
fe.Fi 


— max 6(f) 


X 


= max 6(f) 
feT 2 


— max 6(f) 



The optimization problem on the PWIT is 

maximize 6(f) over f G T. (15) 

To study this we define 

(16) 
(17) 
(18) 

Because J 7 2 D T = U T\ we have 

X > Y = Z + . 
The recursion for X, analogous to (fTTj) . is 

X = Y,^-ii + x i~ Y i) + - (19) 

The argument is the same as for (|ll(l : the contribution to 6(-) by using edge 
(root, i), as compared to not using it, equals (A — & + Xj — Yj), and we may 
use any number, or zero, such edges. The recursion for Z is 



Z = max ( £(A - & + ^ - *•) + ]T(A + >i) + ) (20) 

where / denotes a non-empty subset of {1,2,3,...}. Here the first sum 
represents the contribution from the set / of children i such that, in the 
optimal f G T\, in the subtree Tj the root i is in an infinite component. 
The set / must be non-empty in order for f G T\. Now the fact Xi > Z 
implies 

\ — £i + Zi — Yi < (\ — & + Xi — Y) + 

which implies there is an optimal / with only one element, and we can 
rearrange (|20j) to become 

Z = X + max ((A - & + Z< - Y) - (A - & + X< - Y)+) . 

i 

Finally, since Y = Zf we obtain the following RDE for the joint distribution 
of (X, Z). 

i 

Z = X + max((A-& + Z i -Z+)-(A-& + A^-Z+)+) . (22) 
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We next show how the desired quantities 5(X) and e(A) can be obtained 
from (X, Z) . Consider the optimal f in Q15|) . This f contains the root if and 
only if f E J-\, that is if and only if Z > 0, so 



(5(A) = P(Z > 0). 



(23) 



When Z > 0, the set I of edges at the root used in f is the set of i for which 
the contribution (A — & + X; L — Zf) is strictly positive, plus (if distinct) the 
maximizing i in (|22j) . This leads to 



for 1 as above. 

5 Trees 

Studying trees t in order to study the limit function e*(5) at © is a little 
more subtle. What are the feasible solutions on the PWIT corresponding to 
the limits of trees in SMF„? At first sight they are just the set T of forests 
f = (tj) in section^] But this is wrong; instead, by analogy with many other 
examples of limits of infinite trees [2J El the relevant feasible solutions are 
forests f = (tj) with the extra property that each of whose tree-components 
tj have one end; that is, from each vertex of tj there is exactly one infinite 
path in tj. 

To mimic the analysis of the previous section with this family of forests, 
it turns out we need, in place of Ti before, the family defined as 

Ti is the set of such forests, modified so that the tree-component con- 
taining the root may be either empty or finite, but not infinite. 

But now the analog of X at (|18|) cannot be represented recursively, since 
(roughly speaking) there is no recursive criterion for finiteness. Instead we 
need to consider, separately for m = 0, 1, 2, . . ., a definition such as 

T *( m ) is the set of such forests, modified so that the tree-component 
containing the root has exactly m edges. 

Defining X m in terms of a maximum over leads to a RDE for the 

infinite family (Xq, X\, X2, ■ ■ ■ , Z). But we have not attempted to solve this 
numerically. 

Fortunately, this detailed analysis is unnecessary for investigating the 
scaling exponent because 



e(A) 



E (fEiexCi) l (z>o) 



(24) 



~8(\) 



e*(8) = i(6) when s(5) < e 



1 
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To outline the argument, consider the minimizing edge-set e for e(S) in 
this range. Suppose e contains a cycle of length order n. By the fact (for 
paths) e(0+) = e _1 , this cycle has average edge-length > e _1 and hence has 
some edge of length > e _1 . Removing this edge would reduce A(e) without 
essentially affecting the constraint on len(e), contradicting minimality. So 
e can have no cycles of length order n. As for short cycles, fix a < e~ 1 and 
consider a typical point v of SMF n . By the arguments of El (comparison 
with the Yule process), 

P(v in any cycle c with A(c) < a) — > as n — > oo 

and it follows that the contribution to A(e) from short cycles — > as n — > oo. 



6 Final remarks 

6.1 Sampling errors in Table 1 

We treat the case of trees; the case of paths could be treated similarly. 
To obtain the numerical values in Table 1, we represented the distribution 
(X, Z) via 10 6 points and iterated the RDE 1000 times, truncating the 
Poisson process (£j, 1 < i < oo) at i = 20. This necessitated, for each value 
of A, a total of 2 x 10 10 calls to the random number generator. We calculated 
e and 5 using the final 200 generations, that is using 2 x 10 8 points. There 
are various possible errors in this way of estimating scaling exponents, of 
which the only one which can be quantified is "sampling error". Clearly 



s.d. (estimate of 5) » 5 1/2 /V2 x 10 8 » 0.7 x 10~ 4 5 1/2 

which is negligible. But the error for e is not negligible, since it is based on 
only a proportion 5 of the samples, giving 

s.d. (estimate of e) ~ 



V2 x 10 8 5 

where s.d.(£) w 0.3 is the s.d. of the ^-values used to estimate e via l|24|). 
This leads to 

s.d. (estimate of e/5 1/2 ) «2x 10~ 5 S' 1 
which are the db values shown in Table 1. 
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6.2 Rigorous bounds on scaling exponents 

Because the limit e(0+) = e _1 in the paths setting is essentially just a first 
moment calculation, a referee suggests that similar first moment methods 
should establish rigorously some bound on e(5) and hence some bound of 
the scaling exponent in the paths case. We concur, but have not attempted 
a detailed calculation. 

6.3 Scope of scaling exponents 

It seems difficult to specify precise the range of settings in which a definition 
of percolation-like scaling exponent makes sense and is interesting. Within 
the stochastic mean field model there is a well studied minimum matching 
problem (see 120] for recent proofs of the Parisi conjecture) in which 
context one could define 

, „ length min matching of some Sn vertices 

match ^ _ ^ E 5 6 

But here it is clear that 

e™ tch (5) ~ 5 as 5 j 

so that the critical value equals and the scaling exponent equals 1. How- 
ever, since the critical value equals zero we are inclined to regard this case 
as "not percolation-like". A referee suggests the example (again, within the 
stochastic mean field model) of the path through 5n points chosen greedily 
by choosing the shortest available edge at each successive vertex, but this 
also seems "not percolation-like". 

Acknowledgement. I thank two anonymous referees for helpful com- 
ments. 
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